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Abstract 

An  err or-correct i ng  version  of  tree  automaton  is  studied.  Assume  the  syntax 
errors  on  trees  to  be  of  five  types:  substitution,  stretch,  split,  branch,  and 

deletion.  The  error-correction  problem  is  to  determine  the  distance  between  two 
trees  as  measured  by  the  minimum  cost  sequence  of  error  transformations  needed  to 
transform  one  to  the  other.  Two  types  of  error-correcting  tree  automaton  (ECTA) 
are  proposed:  structure-preserved  ECTA  and  generalized  ECTA.  The  structure-preserved 

ECTA  takes  substitution  errors  into  consideration  only.  A LANDSAT  data  interpretation 
problem  is  used  as  an  example  to  illustrate  the  process  of  structure-preserved  ECTA. 
The  restriction  on  tree  structure  is  removed  in  formulating  the  generalized  ECTA. 

All  five  types  of  syntax  errors  are  considered.  Any  erroneous  tree,  in  this  case,  is 
able  to  find  its  most  similar  (in  terms  of  minimum  distance)  correction  in  the  sense 
of  both  node-labeling  and  tree  structure. 
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1 . Introduction 


In  order  to  tackle  with  the  uncertainty  that  usually  exists  in  the  process  under 
study,  error-correcting  parsing  techniques  have  recently  been  applied  to  the  aiyas  of 
compiling,  computer  communication,  and  syntactic  pattern  recognition  [3,  15,  23,  2U] . 
The  most  widely  used  error-correcting  parsing  scheme  is  formulated  to  Include  substi- 
tution, insertion,  and  deletion  errors  for  str ing-to-string  correction  [28],  The 
basic  approach  is  to  define  these  three  types  of  syntax  errors  in  terms  of  error 
transformations.  The  original  grammar  is  then  expanded  such  that  error  transformations 
on  each  terminal  symbol  has  their  corresponding  terminal  error  productions  [1],  During 
the  error-correct i ng  process,  a decision  criterion  is  required  when  the  parsing  pro- 
cedure faces  multiple  choices  of  the  next  move.  Two  decision  criteria  have  been 
proposed:  the  minimum-distance  criterion,  where  distance  is  measured  in  terms  of  the 

number  of  error  transformations  used  in  a deterministic  model  [1,  21,  23],  and  the 
max i mum- 1 ike  1 i hood  criterion  in  a probabilistic  model  [15,  22,  2^4 ] . 

In  applying  syntactic  methods  to  pattern  recognition,  one-dimensional  (string) 
grammars  are  sometimes  inefficient  in  describing  two-  or  three-dimensional  objects. 

For  the  purpose  of  effectively  describing  high-dimensional  patterns,  high-dimensional 
grammars  such  as  web  grammars,  graph  grammars,  and  tree  grammars  have  been  proposed. 

In  practical  application,  tree  grammars  and  tree  automata  have  been  used  in  the 
classification  of  fingerprint  patterns  [19],  the  analysis  of  bubble  chamber  events  [M  , 

and  the  interpretation  of  NDSAT  data  [17], 

Generalized  finite  automata,  called  tree  automata,  which  accept  finite  trees  of 

symbols  as  its  input,  have  been  studied  by  several  authors  [6,  8,  20,  26],  Brainerd  [6] 
proves  that  the  class  of  systems  which  generate  exactly  the  sets  of  trees  accepted  by 
the  automata  is  a regular  system.  Fu  and  Bhargava  [1^]  first  introduced  the  application 
of  the  tree  systems  into  pattern  recognition.  Later,  as  studied  by  Brayer  and  Fu  [7], 
tree  languages  are  actually  a subset  of  graph  languages  corresponding  to  the  type  3 in 
the  string  language®  case. 
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The  descriptive  power  of  tree  languages  and  the  efficient  analytical  capability 
of  tree  automata  made  the  tree  system  approach  to  pattern  recognition  very  attractive. 
This  paper  is  concerned  with  the  error-correct ing  version  of  tree  automata.  Unlike 
the  string  case,  where  the  only  relation  between  symbols  is  left-right  concatenation, 
a tree  structure  would  be  deformed  under  deletion  or  insertion  errors.  The  structure- 
preserved  error-correcting  tree  automaton  (ECTA)  proposed  in  Section  3 takes  only 
substitution  errors  into  consideration.  By  introducing  a blank  element,  a deletion 
error  can  be  treated  as  substitution  of  a non-blank  element  by  a blank  element,  and 
an  insertion  error  becomes  a non-blank  element  in  substitution  for  a blank  element. 

An  example  of  using  ECTA  in  LANDSAT  data  interpretation  is  presented  in  Section  A. 

In  Section  5,  syntax  errors  on  trees  are  defined  in  terms  of  five  types  of 
error  transformations;  namely,  substitution,  stretch,  branch,  split,  and  deletion. 

The  distance  between  two  trees  is  the  least  cost  sequence  of  error  transformations 
needed  to  transform  one  to  the  other.  By  this  definition,  tree  distance  is  measurable 
between  trees  of  different  structures.  Based  on  the  metric  defined  in  Section  5. 
a generalized  error-correcting  tree  automaton  (GECTA)  is  formulated  in  Section  6. 

The  GECTA  accepts  structurally  distorted  as  well  as  node-mislabeled  trees  such  that 
their  minimum-distance  corrections  can  be  found.  An  example  of  hand-printed 
character  recognition  is  given  in  Section  7 to  demonstrate  the  operation  of  GECTA. 

A discussion  on  the  capability  of  general  error-correcting  parsing  schemes  and  their 
prospects  in  the  area  of  pattern  recognition  can  also  be  found  in  Section  7. 
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2.  Def ini tions 

In  this  section,  some  basic  definitions  on  trees,  tree  grammars,  and  tree 
automata  given  by  Brainerd  [6]  are  briefly  reviewed, 

DEFINITION  2, 1 . Let  N+  be  the  set  of  positive  integers.  Let  II  be  the  free 
monoid  generated  by  N+.  Let  • be  the  operation  and  0 the  identity  of  II.  The 
depth  of  acll  is  denoted  d(a)  and  defined  as  follows:  d(0)=*0,  d(a*  i)=d  (a)  + l , 
ieN+.  a <_  b iff  there  exists  xeU  such  that  a*x=b.  a and  b are  incomparable 
iff  a £ b and  b £ a. 


ala2 


DEFINITION  2,2.  D is  a tree  domain  iff  D is  a finite  subset  of  U satisfying 
(1)  beD  and  a < b implies  asD,  and  (2)  a*jeD  and  i < j in  N+  implies  a»?eD. 
DEFINITION  2,3,  A stratified  alphabet  is  a pair  <E,r>  where  E is  a finite 
set  of  symbols  and  r:E-*-N.  Let  E = r ' (n) . 

DEFINITION  2,1<.  A tree  over  E (i.e.,  over  <E,  r>)  is  a function  a:D-*E,  such 

that  D is  a tree  domain  and  r[a(a)]  = max{ i | a» ? ED} . I.e.,  the  stratification 

of  a label  at  a must  be  equal  to  the  number  of  branches  in  the  tree  domain  at  a. 

The  domain  of  a tree  is  denoted  D (ot)  or  D . Let  T„  be  the  set  of  all  trees  over 

a E 

E.  The  depth  of  a is  defined  as,  d(a)  = max{d (a) | aeD^} . 

DEFINITION  2,5.  Let  a,  b,  b'eU  such  that  b = a*b',  then  b/a  = b'.  b/a  is 
not  defined  unless  a <_  b. 

DEFINITION  2.6.  Let  aeT^.  and  aeD  , a/a  = {(b,  x)  | (a*b,  x)ea},  a/a  is  the 
subtree  of  a at  a and  a/a  occurs  at  a in  a. 

DEFINITION  2.7.  Let  aeTj.,  aeU,  then  a*a  «*  {(b,  x)|(b/a,  x)ea}. 

DEFINITION  2.8.  Let  aeDa,  a,  SeTj,,  then  a(a-^6)  = ■[  (b , x)ea|  b£  a}  U a*6. 

This  is  the  result  of  replacing  the  subtree  a/a  at  a by  the  tree  6. 

n 

Using  postfix  notation,  the  tree  a = U i»a.  U{(0,  x) } is  represented  by 

i = l ' 

• • • Ct  X • 

n 


c: 
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DEFINITION  2.9.  t Is  a term  over  <E,  r>  Iff  t * xeE„  or  t = t.t_...t  x 
1 " ■”  0 I 2 n 

where  xeE^,  and  t.,  * £ I ^ n,  Isa  term,  T^.  Is  the  set  of  terms  over  E. 
There  is  obviously  a one-to-one  correspondence  between  the  terms  over  E 
and  the  trees  over  E.  If  the  corresponding  tree  of  t i s a subtree  of  a, 
we  say  t Is  a term  in  a. 

DEFINITION  2.10.  A regular  tree  grammar  over  <E,  r>  is  a regular  system 
G = (V,  r1,  P,  S)  satisfying  the  following  conditions: 

(1)  <V,  r*>  is  a f i n i te-ranked  alphabet  with  ECV,  and  r ' | E = r.  The 
element  of  V and  V-E  are  called  terminal  and  nonterminal  symbols, 
respectively. 

(2)  P is  a finite  set  of  productions  of  the  form  where  <f>  and  \p 

are  trees  over  <V,  r'>. 

(3)  S C Ty  is  a finite  set  of  axioms. 

DEFINITION  2.11.  Derivation  a%  is  in  C iff  <jr+\p  in  P such  that  a/a  = 

and  6 = a(a*-\Jj).  cf*>6  is  a derivation  iff  there  exist  a^.a^ , . . . ,0^,  m >_  0 

such  that  a = a„-Hx.  = g in  G^. 

U I m t 

DEFINITION  2.12.  The  language  generated  by  Gt  = (V,  r1,  P,  S)  over  <E,  r> 
is  defined  as,  L(G  ) = {aeTy|there  exist  xeS  such  that  x — > a}* 

DEFINITION  2.13.  A tree  grammar  G^  = (V,  r',  P,  S)  over  <E,  r>  is  expansive 
iff  each  production  in  P is  of  the  form 
x 

XQ  -*■  /\\  or  XQ  ■+■  x where  xeE. 


and  Xq,  Xj,...,Xr^j  are  nonterminal  symbols. 

For  our  convenience,  we  further  define  immediate  predecessor,  immediate  successor, 
and  adjacent  node  as  follows. 


"““,™  '',l' 


— 
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DEFINITION  2.14.  Let  a be  a tree  over  <E,  r>,  and  a.beD^,  a(a)  is  the  immediate 
predecessor  of  a(b)  iff  there  exists  i eN+  such  that  a«i  * b.  If  x is  the  immediate 
predecessor  of  y in  a,  then  y is  an  immediate  successor  of  x in  a. 

DEFINITION  2. 15.  Let  a be  a tree  over  <E,  r>  and  a,  beD^.  a(a)  is  right  adjacent 
to  a(b)  if  ci(a)  is  immediate  right  adjacent  to  ot(b)  in  postfix  notation  of  a. 

If  x is  right  adjacent  to  y in  a,  then  y is  :eft  adjacent  to  x in  a. 

Tree  automata  have  been  studied  and  defined  by  several  authors  [6,  8,  20,  25,  26], 
Following  the  definition  given  by  Brainers  [6],  a tree  automaton  is  a finite  automaton 
with  many-to-one  state  transition  functions. 

DEF IN ITI ON  2.16.  Let  <E,  r>  be  a stratified  alphabet  and  £ = ,a^. . . ,a^} . 

A finite  E-automaton  or  tree  automaton  over  E is  a system  M^  = (Q,  fj,...f^,R)  where 

(1)  Q is  a finite  set  of  states, 

(2)  for  each  i,  1 1 <_  k,  f is  a relation  on  Qr^°i^xQ, 


(3)  R C Q is  a set  of  final  states. 


r (o.) 


I f each  f . , 1 <_  i <.  k , is  a function  f . : Q i ' Q then  M^  is  deterministic; 

otherwise  M is  nondetermi ni st ic. 

The  construction  procedure  of  tree  automaton  for  a regular  tree  grammar  can  be 
summarized  as  follows  [14] . 

ALGORITHM  2.1.  Construction  of  tree  automaton. 

■ i 

Step  1 . To  obtain  an  expansive  tree  grammar  (V  , r,  P , S)  for  the  given 

regular  tree  grammar  (V,  r,  P,  S)  over  alphabet  E. 

Step  2.  The  equivalent  nondetermi ni st ic  tree  automaton  is  M = (V'-E,  f^,...,f^, 

{ S }) where  f.(X,,...,X  )~Xn  if  X.  -*■  X....X  x.  is  in  P'. 
ii  n u U I n i 

The  acceptance  of  a tree  by  tree  automaton  is  a backward  procedure.  It 
reads  parallel  branches  simultaneously  then  transfers  to  the  states  of  their  immediate 
predecessors. 


ku. 


---ifTir- iTitr-  a m t 


Mrii-ii  Ml  mnliru 


— I I«n  ilrnMWM.i 
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3 . Structure-Preserved  E rror-Correc t i ng  Tree  Automaton  (ECTA) 

Let  D be  a tree  domain,  DCTU,  I be  a set  of  terminal  symbols,  we  define 

T„°  * {ala  e T_,  D = D}  be  the  set  of  trees  in  the  tree  domain  D.  In  this  section, 

E 1 E a 

substitution  error  is  described  in  terms  of  the  transformation  S:  ->  J , For 

D Sa/x 

a e 0,  x e E and  a,  a*  e Tj-  , we  write  a| a'  if  a'  is  the  result  of  replacing  the 

k 

label  on  node  a of  tree  a by  terminal  symbol  x.  Furthermore,  S denotes  the  composition 
of  S with  itself  k times. 

The  distance  on  trees  in  Ty°,  p(a,  a'),  is  defined  as  the  smallest  integer  k for 

k 1 

S D 

which  a! a'  if  a and  a1  are  two  trees  in  T^.  for  some  D C U.  The  function  of  p 

is  symmetric  and  satisfies  triangle  inequality.  Let  L be  a tree  language,  and  tree 
a1  $ L.  The  essence  of  ECTA  is  to  search  for  a tree  a,  a e L such  that 

u(ci,  a')  = {p(6,  a')|8  e L,  Dg  = Da,}  (3-0 

and  reconstruct  a'  as  a.  p(a,  a')  in  the  above  equation  is  also  defined  as  the 
distance  of  a'  from  I , denoting  p (a1),  a is  called  the  minimum  distance  correction 
of  a'  in  L. 

3 . 1 Mini mum-d i stance  ECTA 

By  using  the  idea  of  adding  terminal  error  production  rules  corresponding  to 
substitution  error  transformations,  as  proposed  by  Aho  and  Peterson  [1],  the  covering 
grammar  G ^ , = (V1,  r',  P1,  S)  of  a given  tree  grammar  = (V,  r,  P,  S)  is  constructed 
as  follows: 

Step  1.  V'  = (V-E)  U E 1 , where  EO  E is  a new  set  of  terminal  symbols. 


Step  2.  For  each  y e £'  add  to  P' 


/\  •l,Xo  V\ 

I r (x)  I r (x) 


or  X^  -*•  y if  Xg-*x  is  in  P. 

The  language  generated  from  consists  of  the  language  L(G{)  and  its  corre- 
sponding erroneous  trees.  Hence,  L(Gt)  can  be  written  as 

L(g')  = {a'|a'  e Tr(,  and3a  e L(G  ) such  that  D , = D } 
t L t ot  ot 

Following  the  concept  of  tree  automaton,  the  ECTA  is  a backward  procedure  of 

constructing  a tree-like  transition  table.  Assume  that  a is  the  input  tree.  For 

each  node  a e D , there  is  a corresponding  set  of  triplets,  denoting  t in  the 

Ot  3 

transition  table.  Each  triplet  (X,£,k)  is  added  to  t if  X is  a candidate  state  of 

d 

node  a,  Z is  the  minimum  number  of  errors  in  subtree  a/a  when  node  a is  represented 
by  state  X,  and  k specifies  the  production  rule  used. 

For  a given  tree  grammar  G^  = (V,  r,  P,  S)  the  tree  automaton  that  accepts 
L(G^)  and  emits  a parse  that  consists  of  the  minimum  number  of  error  production  rules 
is  given  as  fol lows. 

ALGORITHM  3.1.  Minimum-distance  ECTA. 

Input:  G^  = (V,  r,  P,  S)  and  tree  a. 

Output:  Transition  table  of  a and  ^ (a). 

Method: 

(1)  If  r[a(a)]  = 0,  a(a)  = x,  then  add  to  tg 

(a)  (Xg,0,k)  if  Xg  -*■  x is  the  k^  rule  in  P. 

(b)  (Xg,l,k)  if  Xq  -*■  y is  the  kth  rule  in  P and  y t x. 

(2)  If  r[a(a)]  = n > 0,  a(a)  = x,  then  add  to  t 
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(a)  (X^.H.k),  if  Xg  -*■  is  the  kth  rule  in  P and 


0 A 


X....X 
1 n 


(X  ,£  ,k  ) e t ,,...,(X  ,£  ,k  ) e t then  9.  = P„.  + ...+  9. 

a • 1 ' ' n ' n ' n a . n I r 


(b)  (XQ,£,k)  , i f XQ  -*•  t\  is  the  kth  rule  in  P,  y t x,  and 


X,  ...X 

n 


(X.,?.,,k,)  e t .,..., (X  ,H  ,k  ) e t then  H = S.,  +...+  9.  + 1. 

Ill  a* 1 ’ n’nn  a«n  1 n 

(3)  Whenever  more  than  one  item  in  t has  the  same  State,  delete  the  item  with 
larger  number  of  errors. 

("*)  lf  A'-A  £ tQ,  then  (a)  _ | f no  ; tem  ;s  [ n Qf  the  form 

(S,£,k),  then  no  tree  in  L(Gt)  is  in  tree  domain  D^,  the  input  tree  is 
re  jected . 

The  parse  of  a can  easily  be  traced  out  from  the  transition  table. 

The  tree  grammar  in  the  following  example  is  part  of  the  highway  grammar  used 
in  Section  4.  In  the  meantime,  we  use  it  as  an  example  here  to  illustrate  the  operation 
of  structure-preserved  ECTA  and  to  demonstrate  the  highway  patterns  recognition  procedure 
that  will  be  discussed  in  Section  4. 

EXAMPLE  3.1.  Consider  a set  of  vertical  line  patterns  as  given  in  Fig.  3.J. 

Assume  that  elements  in  the  Ax1*  array  are  connected  as  a tree  shown  in  Fig.  3.2. 

Thus,  each  pattern  has  its  corresponding  tree  representations.  For  example,  pattern  (b) 
in  Fig.  3*1  can  be  represented  by  the  tree  shown  in  Fig.  3*3,  where  nodes  labeled  by 
symbol  "b"  represent  blank  elements  " D ",  and  nodes  labeled  by  "h"  represent  highway 
elements  " ^ ".  The  tree  grammar  that  generates  these  tree  representations  can  be 


written  as: 
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G = (V,  r,  P,  S)  over  <£,  r>  where 
H 

V = {S,  Ag,  Aj,  A^,  A^f  Xg,  Ij*  1 2 » 1 3> 

I = { * □ m > 

$ * b ’ h 

r($)  = 1,  r (b)  = {0,  1,  3),  r(h)  = {0,  1,  3) 

P:S*  * * * * 

| (1),  | (2),  I (3),  | (M 


to* 

(v 

9. 

5) 

(A,, 

3, 

8) 

<X0. 

5, 

14) 

(a2, 

5, 

9) 

(a3, 

9, 

10) 

t0*l 

•1  •' 

1 

<v 

2. 

6) 

"r 

1, 

11) 

<V 

o, 

15) 

02, 

1, 

12) 

lo 

.1  • 

<V 

3, 

6) 

( 1 1 ’ 

2, 

ID 

(X0, 

0, 

15) 

( 1 2 * 

1, 

12) 

" ■ 

1, 

13) 

0 * 1 * 2 * 1 


(Aq,  1,  6) 
(I,,  1,  ID 
(XQ,  1,  15) 
( I 2.  0.  12) 


lo 

• 1* 

2 

<V 

3, 

5) 

(A,, 

3, 

8) 

(Xq, 

<*, 

14) 

(A2, 

7, 

9) 

0* 1*2*2 
(Aq , 1,  5) 

(Xq,  1,  14) 
(A,,  3,  8) 


0*1  *3 

(Aq,  2,  6) 
(I,,  0,  11) 
(Xq,  1,  15) 

I 02,  2,  12) 

| 03,  2,  13) 


t0. 1 

■ 2 • ' 

3 

(Aq, 

1, 

6) 

( 1 1 ♦ 

o. 

11) 

(Xq, 

1. 

15) 

l** 

2, 

12) 

1 -2.2-2 
(Aq,  1,  7) 

(Xq,  0,  16) 

(I,.  1,  17) 


"0*  1 • 2 * 2 • 31  I 0-1  -2-3-1 
(A„.  1,  7)|  | (Aq,  1,  7) 


0- 1 - 3- 1 • 1 

(Aq,  1,  7) 

(xn,  0,  16) 


- " ■ • 1 " 1 _ _ 1 "V  m.m 
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Given  a noisy  pattern  and  its  tree  representation  a,  which  are  shown  in  Fig.  3.A(a) 
and  (b)  respectively,  the  recognition  of  a by  using  ECTA  is  shown  in  Fig.  3.5.  Since 
(S,  3,  2)  is  in  t^,  a is  accepted  by  the  ECTA,  and  the  number  of  errors  is  3.  The 
parse  that  generates  the  corresponding  correct  tree  of  a is  shown  in  Fig.  3.6. 

3.2  Max i mum- 1 ike  I i hood  ECTA 

When  the  probability  distribution  of  patterns  and/or  deformation  probabilities 
of  each  terminal  are  available,  error-correcting  parsing  based  on  max i mum- 1 ike  1 i hood 
criterion  become  a plausible  approach  in  handling  noisy  patterns.  Definitions  and 
properties  on  stochastic  grammar,  terminal  deformation  probabilities,  and  maximum- 
likelihood  error-correcting  parsers  have  been  introduced  in  [10,  11,  15,  1 8 , 23,  2**]. 

The  expansive  stochastic  tree  grammars  and  languages  defined  in  [A]  are  briefly 
reviewed. 

DEFINITION  3.1.  A stochastic  tree  grammar  Gs  = (V,  r,  P,  S)  over  Mj  is  expansive 

if  and  only  if  each  rule  in  P Is  of  the  form 

Xq  -£-> or  Xg  — > x where  x e E 

X....X  , . 

1 r (x) 

and  X. , X.  , . . . ,X  M t V-  E are  nonterminals. 

0 1 r (x) 

DEFINITION  3.2. 

k 

pi  r 

L (G_ ) = {(a,  p(a))|a  e T_,  S =>  a,  i = l***k,  p(a)  = £ P;) 

s h i = l 

where  k is  the  number  of  all  distinctly  different  derivation  of  a from  S,  and 

p.  is  the  probability  associated  with  the  i*^1  distinct  derivation  of  a from  S. 

Assume  that  the  occurrence  of  substitution  error  on  a terminal  is  independent 
from  its  neighboring  terminals.  Fung  and  Fu  [15]  define  substitution  error  as  a 
stochastic  mapping:  6:  E -*•  E such  that  6(a)  = b,  a,  b £ E,  with  probability  q(b|a), 

and  furthermore, 

I q (b  | a)  = 1 

beE 


I 
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In  our  application,  assume  that  t = t,**»t  x is  a term  over  <£,  r>.  We  have 

n 


6(t,***t  x)  ■ 6(t.)  •••  6(t  )6(x) 
in  I n 


A tree  t * t.***t  x is  said  to  be  locally  corrupted  under  6 mapping. 
I n 


If  two  trees  a = t,**»t  x,  a'  = t '•••t  1 x1  are  in  T D,  the  probability  of  a'  being 
In  In  I ’ 


the  noisy  deformed  tree  of  a i s 


q(a'|a)  = q ( 1 1 * 1 1 1 ) •••  q(tn'|tn)  q(x'|x) 


We  further  have 

J q(a'|oc)  = 1 for  any  a £ Tv°  , D C U 
a'eT^D 

For  a given  stochastic  grammar  G<.  = (V,  r,  P,  S)  over  <E,  r>,  when  the  deformation 
probabilities  q(y|x)  are  known  for  all  x £ £,  y e £',  the  stochastic  covering  grammar 
becomes  G^'  = (V‘,  r1,  P',  S)  over  <£',  r*>  where  V'  = (V-E)  U E1  and  £'  3.  £ is  the 


set  of  terminal  symbols,  and  for  all  y £ £'  X 


or  XQ  -c_> 


is  i n P 1 i f X, 


/\ 


y is  in  P 1 if  XQ  -E->  x is  in  P , 
is  in  P and  p'  = p q(y|x) . 


1 * * *Xr (x) 


Xr*,Xr(x) 


Apparently,  L(Gg,)  can  be  written  as  L 1 ) ■ {(a1,  p^a1))^1  £ T^., 

p' (a1)  = l q(a' | a)  p(a) } 

a£L(Gs) 

0 ,=D 

a'  a 

Suppose  that  the  given  noisy  input  tree  a1  is  in  tree  domain  D,  i.e.,  a'  E T^,  the 
max i mum- like  I i hood  error-correcting  decision  rule  in  this  case  is  to  choose  a tree  a 


IL 
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in  L(G<.)  of  domain  D,  i.e.,  a e L(G,.)  and  a C T^D,  such  that 


max 


q(o'|a)p(a)  = 6 e L(G$)nT  Oq(o'|B)p(B) 


(3.2) 


We  call  this  value,  q(u'|u)p(a),  the  probability  of  a'  being  a noise  deformed 
tree  of  L(G^)  and  denote  it  as  q (a* | L ) • 

The  structure-preserved  ma  . i mum- 1 i ke 1 i hood  ECTA  is  given  as  follows: 

ALGORITHM  3»2.  Maxi  mum- 1 i ke 1 i hood  ECTA 


Input:  (1)  Stochastic  grammar  G,.  = (V,  r,  P,  S). 


(2)  Deformation  probabilities  q(y|x)  for  all  x e E,  y e E'. 

(3)  Input  tree  a. 


Method:  (1)  If  r[u(a)]  = 0,  a(a)  = y then  add  to  t , (Xn,p',k),  if  X 

d U 


X I s 


the  k*^  rule  in  P and  p'  = p q(yjx). 


(2)  If  r[a(a)]  = n,  n > 0,  a(a)  = y then  add  to  t (X  ,p',k),  if 

3 U 


X — — > /X\  is  the  k1*1  rule  in  P and  (X.,p',k.)  e t . ... 


,,K, 


a*  1 


X,  ...X 
1 n 


(Xn,Pn,,kn)  G ta<n  then  p'  = P] **. . .*pn '*p*q (y | x) . 


(3)  Whenever  more  than  one  item  in  t have  the  same  state,  delete  the 

3 


item  associated  with  smaller  probability. 

(A)  If  (S,p',k)  e tQ,  then  q(a|L)  = p'.  If  no  item  in  t^  is  associated 


with  the  start  state  S,  then  no  tree  in  L(G<.)  is  in  tree  domain  D^. 


Input  tree  is  rejected. 

k.  Application  of  ECTA  to  Recognition  of  Highway  Patterns  from  LANDSAT  Data 


Recently,  syntactic  methods  have  been  used  to  analyze  and  interpret  data  obtained 
from  the  earth  resource  technology  satellite  (LANDSAT)  [7,  1 7] • The  input  data  used 
by  Brayer  and  Fu  or  Li  and  Fu  are  the  results  of  pointwise  classification  [27].  Each 
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pixel  collected  by  LANDSAT  represents  a ground  area  of  approximately  60  x 70  m . 
According  to  spectral  and/or  temporal  measurements  of  the  object,  a pixel  is  then 
classified  into  classes  of  water,  cloud,  downtown,  concrete,  or  grass,  etc.  Due  to 
the  resolution  size,  spectral  signals  of  smaller  objects  are  usually  composed  of 
reflectance  of  several  different  kinds  of  ground  cover.  For  Instance,  the  spectral 
signal  of  a segment  of  a highway  actually  results  from  a combined  reflectance  of 
concrete  surface,  grass,  and  transportation  vehicles.  Consequently,  the  variation 
of  size  of  smaller  objects  and  their  surroundings  chang  -s  their  reflectances,  and 
thus,  their  spectral  properties  from  point  to  point.  This  uncertainty  causes  some 
difficulty  in  setting  threshold  for  classification  based  on  spectral  information  of 
individual  points  only.  One  example  of  the  results  of  pointwise  classified  patterns 
is  given  in  Fig.  4.1  which  covers  the  area  of  the  northern  part  of  Grand  Rapids, 
Michigan.  Each  symbol  "H"  represents  a pixel  that  is  classified  as  a segment  of 


highway.  Fig.  4.2,  which  is  obtained  from  the  official  highway  map,  indicates  the 


major  divided  highways  of  the  same  area. 

As  illustrated  in  Fig.  4.1,  the  inadequate  resolution  of  highways  and  the  mass 
of  scattered  concrete  and  grass-mixed  objects  other  than  highways  result  in  dis- 
continuity of  highways  and  spurious  points  from  pointwise  classification.  Therefore, 
the  need  of  using  syntactic  methods  as  a refinement  becomes  evident.  Syntactic 
pattern  recognition  has  the  advantage  of  using  contextual  and  structural  information 
contained  in  patterns  for  recognition  purposes. 

Brayer  and  Fu  [7]  use  web  grammars  modeling  classes  of  clouds,  downtown,  highways, 
etc.  Due  to  the  lack  of  an  efficient  parsing  procedure  for  web  grammar,  a matching 
process  is  used  in  the  actual  recognition  of  patterns. 

In  [17],  Li  and  Fu  use  a tree  grammar  and  a tree  automaton  to  analyze  line 
patterns  such  as  highways  and  rivers.  It  demonstrates  fairly  good  results  in  recognizing 


Fig.  4.1.  Pointwise  Classified  Highway  Data  Obtained  from  Grand  Rapids,  Michigan 
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rivers  among  ponds  and  some  modern  buildings  with  glass  walls  having  similar 
reflecting  surfaces  as  water,  but  poor  in  analyzing  highway  patterns.  Evidently, 
highway  patterns  are  usually  too  noisy  to  be  effectively  analyzed  by  a conventional 
grammatical  inference  procedure  and  parsing  methods. 

In  [7]  and  [17],  pictures  are  scanned  window  by  window.  A window  is  an  array 
of  pixels.  Each  pixel  is  either  represented  by  symbol  "H"  or  is  a blank.  Some 
subpatterns  consisting  of  several  pixels  (2x2  in  [17]  and  5 x 5 in  [7])  are  selected 
as  primitives.  A syntactic  method  is  then  used  to  decide  whether  the  pattern  in  a 
window  is  a desired  pattern.  In  our  application,  we  choose  the  size  of  window  to 
be  an  8 x 8 array  of  pixels,  and  the  labels  on  single  pixels  to  be  primitives.  Thus, 
we  have  two  kinds  of  primitives:  "h"  represent  a segment  of  highway  and  "b"  represent 

nonhighway  area. 

Initially,  we  take  primary  sample  patterns  of  vertical,  horizontal,  or  diagonal 
line  as  our  positive  samples  (or  desired  patterns).  Assume  that  only  a single  segment 
of  highway  or  at  most  two  intersected  highways  that  appeared  within  a window  is  con- 
sidered. Hence,  some  patterns  consisting  of  two  combined  primary  sample  patterns  are 
also  added  to  the  set  of  positive  samples. 

Similar  to  the  procedure  illustrated  in  Example  4.1,  an  array  of  primitives  in  a 
window  are  connected  as  a tree.  Each  primitive  becomes  a labeled  node  in  the  tree 
representation.  We  fix  the  tree  domain  to  be  Du,  and  allow  node  label  to  be  either 
"h"  or  "b."  Hence,  there  are  2^  tree  representations  in  , where  £ = {b,h,$}, 

$ is  a start  terminal.  Apparently,  the  set  of  all  possible  patterns  in  an  8 x 8 window 

and  the  set  of  all  labeled  trees  in  the  tree  domain  D are  one-to-one  correspondence. 

H 

Fig.  4.3  illustrates  the  correspondence  between  points  in  the  8x8  array  and  nodes 

in  the  tree  domain  D , The  set  of  positive  sample  patterns  can  now  be  transformed 
H 

into  a set  of  positive  sample  trees  of  domain  Du.  The  tree  grammar,  G , that  is 

H H 
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inferred  to  generate  positive  sample  trees  is  given  in  Appendix  A.  Fig.  A.l  in 
Appendix  A illustrates  some  typical  patterns  whose  tree  representations  are  in 
L (Gu) . In  Fig.  A.l,  "Group  A"  denotes  the  group  of  patterns  that  are  generated 
from  rules  of  the  form 

S -*•  ^ , i = 1,...,8,  "Group  B"  are  from 

I 

A. 

i 

i 

S -*■  f etc. 

B. 

i 

Note  that  the  tree  whose  nodes  are  all  labeled  with  "b"  also  belongs  to  L ( G ), 

H 

Actually,  the  pattern  it  represented  is  an  undesired  one,  i.e.,  a window  with  no 
highway  passing  through.  It  is  our  negative  sample  tree,  denoting  as  X.  We  may 
categorize  the  sets  of  trees  that  have  been  introduced  so  far  as  follows:  T^H 

is  the  universe  we  are  working  on,  where  I = {$,h,b}.  $ is  a start  terminal. 

TV  Pi  (L(GU)  - {X}  is  the  positive  sample  set,  denoting  as  S . {X}  is  the  negative 
L " D 

sample  set  denoting  as  S . Let  the  set  T_  - (S  US  ) , be  denoted  as  N,  then  N = 

T..^H  L(G^).  Apparently,  elements  in  N are  noisy  patterns,  which  are  normally  mis- 

recognized  by  a conventional  parsing  scheme.  The  idea  of  using  error-correcting  tree 
automaton  is  to  measure  the  distance  between  an  input  pattern  and  patterns  in  S+US  . 

If  the  input  pattern  is  in  N,  it  will  further  be  reconstructed  to  its  best  matching 
pattern  in  S+  in  terms  of  the  minimum  number  of  mislabeled  nodes,  or  erased  if  its 
best  matching  pattern  is  in  S . Since  S is  also  in  L(Gh),  we  may  measure  the  distance 
of  the  input  tree  with  S+  and  S at  the  same  time. 

Assume  that  an  input  picture  is  scanned  column  by  column  from  left  to  right. 

After  an  8 x 8 array  of  primitives  has  been  processed  by  ECTA,  we  erase  the  top  left 
5x5  array  of  pixels,  i.e.,  replace  "H"  points  by  blanks  of  the  original  pattern  and 
superimpose  the  reconstructed  pattern  on  the  original  window.  The  scanning  window 
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then  moves  down  five  rows  of  pixels.  The  pattern  that  appeared  in  the  window  is 
processed  by  the  same  procedure.  After  eight  columns  of  pixels  have  been  scanned, 
the  process  is  repeated  from  the  sixth  column.  By  doing  this,  the  error-correcting 
scheme  would  be  furnished  with  contextual  information,  not  only  within  the  window, 
but  between  the  window  and  its  sur round i ngs . Using  the  pattern  shown  in  Col.  141-148, 
row  1 — 1 8 of  Fig.  4.1,  the  window-by-window  correcting  process  is  illustrated  in  Fig.  4 
The  flowchart  of  the  entire  recognition  procedure  is  given  in  Fig.  4.5. 

This  error-correcting  scheme  of  the  highway  recognition  problem  is  programmed  in 
Fortran  TV  on  CDC  6500  computer  and  tested  using  the  data  shown  in  Fig.  4.1.  The 
result  is  shown  in  Fig.  4.6.  There  are  80  x 160  pixels  in  the  input  data.  The  cpu 
time  for  processing  is  150  sec. 

We  also  use  the  grammar  G , which  is  trained  from  Grand  Rapids  data  to  analyze 
some  other  noisy  data,  such  as  data  obtained  from  Lafayette,  Indiana.  The  pointwise 
classified  data  of  Lafayette  is  shown  in  Fig.  4.7  which  contains  125  x 125  pixels. 

The  result  of  the  error-correcting  analysis  is  shown  in  Fig.  4.8.  The  cpu  time  used 
Is  101  sec.  For  comparison,  we  use  the  highway  map  shown  in  Fig.  4.9  as  ground  truth. 

There  are  several  remarks  to  be  made  about  this  highway  recognition  example. 

Originally,  the  presumed  positive  samples  were  more  than  those  generated  from 
G in  Appendix  A.  Marty  different  patterns  of  two  intersected  highways  were  considered 
After  the  originally  inferred  grammar  was  tested  by  Grand  Rapids  data.  We  further 
removed  production  rules  that  were  infrequently  (or  not)  accessed  and  reduce  the 
highway  grammar  to  Gu  in  Appendix  A. 

The  grammatical  inference  procedure,  when  using  the  error-correcting  parser,  is 
a little  different  from  the  regular  procedure  [13].  With  a presumed  and  roughly- 
sketched  pattern  as  positive  samples,  the  language  generated  from  the  inferred  grammar 
must  be  as  close  to  the  set  of  positive  samples  as  possible.  Otherwise,  there  is 
always  the  possibility  that  the  noisy  pattern  would  find  its  best  match  in  the  set 
of  illegitimate  sentences  (sentences  in  L (G ) but  not  in  S+US  ). 


.... 
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The  alternative  solution,  if  not  using  the  error-correcting  scheme,  of  course, 
is  to  obtain  a sufficient  set  of  positive  samples  and  a set  of  negative  samples  from 
training  data,  and  to  infer  a grammar  so  that  all  the  negative  samples  are  excluded  from 
the  language  and  all  the  positive  samples  are  Included.  It  is  difficult  to  infer  such 
a grammar  when  patterns  are  irregular.  Besides,  due  to  the  large  variation  of  input 
data,  a slight  difference  in  the  training  set  will  cause  some  patterns  to  be  rejected 
during  parsing. 

For  our  convenience,  we  use  fixed  tree  domain.  A deleted  segment  of  highway  is 
taken  as  substitution  of  a highway  primitive  by  a blank  primitive.  A spurious  H 
point  is  considered  as  substitution  of  a blank  primitive  by  a highway  primitive.  A 
generalized  ECTA  which  has  no  restriction  on  using  a fixed-tree  structure  is  introduced 
in  the  following  sections. 

5.  Distance  on  Trees 

The  basic  approach  to  tree  distance  measurement  is  to  formalize  a concept  of 
transformations  of  a tree,  and  to  define  a metric  on  trees  as  the  least  number  of 
transformations  necessary  to  transform  one  given  tree  into  the  other.  This  approach 
has  the  advantage  of  accommodating  itself  to  linguistic  notion.  It  is  also  closely 
related  to  the  natural  way  of  imposing  a metric  on  the  vertices  of  graph. 

In  Section  3,  since  only  substitution  transformations  are  considered,  tree 
distance  is  measurable  when  two  trees  are  in  the  same  domain.  Our  purpose  in 
this  section  is  to  define  transformations  between  trees  in  different  tree  domains 
such  that  tree  distance  is  measurable  for  any  two  given  trees.  Consequently,  a 
syntax  analyzer  that  accommodates  the  transformations  we  defined  will  be  easily 
constructed. 

Extending  the  three  types  of  syntax  errors  defined  on  strings,  namely, 
substitution,  insertion,  and  deletion,  we  define  errors  on  trees  to  be  the  following 


five  operations: 
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(1)  the  substitution  of  the  label  of  a node  by  another  terminal  symbol, 

(2)  the  insertion  of  an  extraneous  labeled  node  between  a node  and  its  immediate 
predecessor , 

(3)  the  insertion  of  an  extraneous  labeled  node  to  the  left  of  all  the  immediate 
successors  of  a node, 

(A)  the  insertion  of  an  extraneous  labeled  node  to  the  right  of  a node, 

(5)  the  deletion  of  a node  of  rank  1 or  0. 

The  three  operations  of  insertions  in  rule  (2),  (3)  and  (M  are  named  as 
stretch,  branch,  and  split,  respectively,  according  to  the  relative  position 
of  the  inserted  node  to  the  original  tree.  Apparently,  the  inverse  operation  of 
any  type  of  insertion  is  deletion,  and  the  inverse  of  deletion  operation  is  one 
of  the  three  types  of  insertion. 

We  define  five  types  of  transformation  S,  T,  P,  B,  D,  from  to  the  subsets 
to  describe  substitution,  stretch,  split,  branch,  and  deletion  errors 
respectively.  Let  a be  a tree  over  <Z,  r>,  aeU,  yeZ,  and  c»i  = a,  r[a(c)]  = n then 
error  transformations  are  defined  as  follows: 


(1) 

Wa> 

= a(eH-{  (0,y)  }U{  i«ct/a»  i | 1 < 

i _<  r 

[a(a) ] }) , 

(2) 

Ta/y(“> 

= ct(a-»-{  (0,y)}  U{l-a/a}), 

(3) 

Ba/y(a) 

“ a(a-»-{  (0,ci(a) ) , (1  ,y)  }U{  i + 1 • 

a/a»  i | 

1 £ ' 

< 

r[a(a)]}) , 

W 

Pa/y(c,)- 

itt(c*n+l  *■  a/c*n)  • • • (c*  i + 1 + 

a/c* i ) 

1 (a  +■ 

(0, 

, y))j 

(5) 

Da/a(a) 

1 \ fa(c*i  a/c«  i + 1 ) • • • (1 

W = | a(c  +■  a/a) 

:.n-l  *■  a/c*n 

) 

if  r [a(a) ] 
if  rfa(a)] 

= 0 
= 1 

S,  T,  B, 

P and  D transformations  are 

i 1 1 us ' 

tra ted 

i n 

Fig.  5.1 

(a) , 

(b),  (c)  , 

(d) 

and  (e). 

respectively. 

We  wr • te 

a| — - — 6,  if  6 is  in  A (a). 

where 

Ae{S, 

T 

. B,  P,  D}, 

and 

further 

denote  that  a|-^-  B for  k > 0 if  8 is  derived  from  a by  applying  k transformations 

where  Ak  denotes  the  composition  of  A with  itself  k times.  The  distance  on  trees 

...  1 Ak  B,  if  a and  B 

over  I,  u(d,B),  is  defined  as  the  smallest  integer  k for  which  a] 

are  two  trees  in  T^,.  The  function  y is  symmetric  and  satisfies  triangle  inequality. 
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For  example,  suppose  a = {(0,$),  (I,-),  (}•!,  P) , (2,~),  (2 - ? . q) } . Then  given  the 
tree  B = { (0 , $ ) , (l,v),  (1*1,  P)  , (1»2,  q),  (2,  q)},  we  can  say  that  u(a,B)  = 3, 
because  B = (P j # j / (S ^ ^ (a) ))  and  no  other  derivation  of  8 from  u costs  trans- 

formations less  than  three,  a and  B are  shown  in  Fig.  5.2. 

Let  L be  a tree  language,  a tree  8 not  in  L can  be  derived  from  some  tree  in 
L by  a sequence  of  error  transformat  ions.  The  distance  of  8 from  L is  defined  as, 

UL (8)  = min{p (a, 8) |ae  L}. 
a 


When  a tree  B is  inserted  between  subtrees  with  roots  at  a^  and  a^  of  tree  a, 
respectively,  the  root  of  8 is  said  to  be  caused  by  a split  error  of  a(a^),  and 
the  rest  nodes  in  8 are  inserted  by  sequence  of  branch  and/or  split  operations. 

Similarly,  when  B is  inserted  to  the  left  of  all  the  immediate  successor  of  node 
a of  a,  then  the  root  of  8 is  said  to  be  caused  by  a branch  error  of  a(a) . If  a/a 
matches  subtree  B/b  of  8,  bj*0,  i.e.,  a/a  = 8/b,  then  the  immediate  predecessor  of 
b is  said  to  be  caused  by  a stretch  error  of  a(a)  followed  by  a sequence  of  stretch, 
branch,  and/or  split  operations.  When  deletions  cause  a branch  of  the  resulting  tree 
to  be  a subtree  of  the  corresponding  branch  of  the  original  tree,  nodes  at  the  lowest 
level  will  fir*. I lie  deleted,  then  followed  by  llieir  predecessors. 

Since  each  grammar  rule  eventually  generates  a branch,  the  tree  metric  is  related 

* 

to  linguistic  model  in  such  a way  that,  if  a/a  is  assumed  to  be  ¥,  and  X —*»>  'f,  where 

Gt 

is  the  given  tree  grammar,  then  a/a  is  represented  by  X^.  This  procedure  can  actually 
be  considered  as  pruning,  although  a/a  and  ¥ may  be  different. 

i 
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EXAMPLE  5.1  Measure  the  distance  between  acyclic  directed  graphs  by 
the  proposed  tree  metric. 

Assume  a directed  graph  of  labeled  vertice  and  unlabeled  branches  as  shown  in 
Fig.  5.3,  the  tree  grammar  inferred  to  generate  such  patterns  is  given  as  follows. 
Gt  = (V.  r,  P,  S) 

V *=  { S , A,  B,  C,  D,  E,  H,  $,  a,  b,  c,  d,  e,  h} 

r($)  = {1},  r (a)  = {0,  3),  r(b)  = {0,  3,  4},  r(c)  = {0,  2} 

r(d)  - {0,  2,  3),  r (e)  = {1,  2},  r(h)  = {1,  2} 


7I\  . 


H D B 


B7/\\ 

C H A E 


D D 

/|\  • /l\ 


CHE 


H A E 


7\  • 

B D 


“7l\ 

A E C 


/\  • 
E C 


d 

/\  • 
A C 


V\  • 

A C 


v\ 

C D 


Suppose  that  a is  the  given  distorted  pattern.  Following  the  grammatical 
rules,  graphs  after  each  tree  error  transformation  used  are  illustrated  In  Fig,  5.4. 


L 


i 
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6.  Generalized  Error-Correcting  Tree  Automata  (GECTA) 

As  discussed  in  Section  3,  for  any  given  a'  not  in  language  L,  the  essence 
of  an  error-correcting  tree  automaton  is  to  search  for  a in  L such  that  the  distance 
of  a from  a'  is  the  smallest  among  all  the  sentences  in  L.  That  is, 

u(a,a')=  ^ v* ( B , ot 1 ) (6.1) 


Note  that  the  restriction  D = 0 , in  eq.  (3.1)  has  been  removed  here. 

a or 

A search  algorithm  for  the  best  match  solution  based  on  grammar  rules  and  error 
transformat  ions  defined  in  Section  5 will  be  studied  in  this  section.  We  called  it 
generalized  error-correcting  tree  automaton  (GECTA)  to  distinguish  from  the  ECTA  of 
Section  3*  Before  introducing  the  GECTA,  a normal  form  of  trees  and  tree  grammars, 
called  the  binary  form,  is  defined. 

6. 1 T ree  Binary  Form 

We  first  define  a binary  tree  grammar  as  follows: 

DEFINITION  6.1.  A tree  grammar  Gb  = (Vb>  rb,  ?h,  S)  over  <Ib>  rb>  is  said  to  be 
in  binary  form  if, 

(1)  A pseudo  symbol  * is  in  £b« 

(2)  rb  = (0,  1,  2} 

(3)  Each  production  in  Pb  is  in  one  of  the  following  forms: 

(a)  U,  X1  *. 

(b)  U2  - U,X,  * 

(c)  XQ  -r  U]X1  x 


(d)  XQ  - Xj  x 

(e)  XQ  -*■  x 


where  Up  Up  XQ,  X1  are  in  Vb  - 2^,  xe  Es  • {*}. 


f 


1<0 


Let  a be  a tree  in  T,.,  and  * be  a pseudo  symbol  not  in  E,  the  conversion  of 
tree  a into  its  binary  form  a is  given  in  the  following  Algorithm. 

ALGORITHM  6. 1 . Binary  Form  Conversion. 

Input:  Tree  a in  T,.. 

Output:  a , binary  form  of  a. 

Method:  Repeat  CONVERT  until  a is  in  binary  form. 

<C0NVERT>:  (1)  If  tp  * x is  a term  in  a,  then  tg  is  said  to  be  in  binary  form. 

(2)  If  t. t are  already  in  binary  form,  and  = (t....t  ) x 

In  l)  i n 

is  a term  in  a,  x e I then  t„  is  said  to  be  in  binary  form 
’ n 0 

i f n = 1 , then  tQ  = (t j ) x 

£ 

If  n > 1,  then  define  t,  . such  that 

1 l n-1 

= (t,)* 

* * 

t = ( t t ) * 

z2  KZ\  zr 


- <*„-2  V, 


)* 


The  construction  of  the  binary  tree  grammar  = (V^,  P^»  r^,  S)  from  a given 

tree  grammar  G^  = (V,  r,  P,  S)  is  as  follows: 

Let  E.  = EU{*},  add  V to  V.  . P.  is  constructed  as  follows: 
b b b 

Step  1 . For  each  production  in  P of  the  form  Xg  -*■  x,  xeEg,  add  X t0 

P,  and  r,  (x)  =0, 
b b 

Step  2«  For  each  production  in  P of  the  form  X^x,  xe£j,  add  X^  -+  X^x 

to  P.  and  r.  (x)  = 1, 
b b 


i- 


r 


— ... 


k\ 


Step  3*  For  each  production  in  P of  the  form  + X....X  x,  xe£  , n > 2f  add . 

0 I n n — 

U0l  * V 
“o2  * U01 V 


U0n-1  - U0n-2Xn-l* 

X0"U0n-lV 

to  Pb,  and  rb(*)  = {1,  2},  rb(x)  = 2.  Add  {UQi|l  <_  I <_  n-1}  to  Vfc. 

Let  the  binary  form  of  a tree  language  LfG^)  be  denoted  as  Lb(Gt)  an^  ^b  be  the 
binary  form  of  grammar  G^,  then  L^G^)  = L (G^) . 

EXAMPLE  6.1.  Character  £ 

Using  tree  representation  of  character  E,  a,  and  its  inferred  tree  grammar  G^, 
the  binary  form  of  a and  G^,  namely,  a*  and  Gb>  are  illustrated.  Pattern  primitives, 
pattern  of  character  E,  and  its  corresponding  tree  representation  a,  are  shown  in 
Fig.  6.1  and  6.2(a),  (b),  respectively.  The  tree  grammar  G^  which  generates  character 
E of  different  sizes  is 

Gf  = (V,  r,  P,  S)  over  <Z,  r> 

V = {S,  B,  C,  D,  $,  b,  d>  Z = {$,  b,  d} 
r($)  = 2,  r (b)  = (l,  2},  r(d)  = (0,  1} 


P: 


A 


V\ 

C D 


b 

I 

B 


A2 


The  converted  binary  form  of  Gf  is  as  follows: 

Gb  = (V  V Pb*  S)  OVer  <rb*  rb> 


Vb  = {s,  u$,  B,  B,  D,  C,  b,  d,  *},  Eb  = eU  {*} 
rb($)  = {2},  rb (b)  = {1,  2}f  rb(d)  = {0,  1},  rfa(*) 


US  D 


A ' 


The  binary  form  of  a,  a*,  is  shown  in  Fig.  6.3. 


6.2  Minimum-Distance  GECTA 


Let  Mb  = (Vb“Eb,  F.  {S})  be  a tree  automaton  that  accepts  a tree  grammar  in 
binary  form  G,  = (V,  , r,  , P,  , S)  over  E.  , where  F is  the  set  of  transition  functions 

D D D D D 

r (x) 

on  (Vb“Eb)  'x(Vb-Eb),  Por  a1'  x C ^b*  adding  error  transitions  according  to 
the  transformations  defined  in  Section  5*  the  expanded  tree  automaton  that  accept 
all  the  possible  erroneous  trees  is  constructed  as  follows: 


Fig.  6.1.  Pattern  primitives 


_ d 
„ d 


$ 


d 

(b)  a 


Fig.  6.2.  Character  E and  its  tree  representation 


Fig.  6.3.  a*.  Binary  form  of  a 


ALGORITHM  6.2.  Expanded  Tree  Automaton  M^. 


Step  1.  Mb  = (V.  F,  {S})  where  V = (vb“Zb)u{ * > . F = F U FS  U F°  U F 1 , FS , F° , F1  are 

called  substitution,  deletion,  insertion  error  transitions,  respectively. 

Step  2.  FS  ={fy(O^X0,  °lFx(O^x0  e F,  x * *,  y c ^-{x}} 

Step  3.  F°  = {f  ( C)  ~XQ,  v|Fx(r.K  XQ  G F,  x ft  *}U 
{fUKXQ,  0 1 F A (O^XQ  G F} 

Step  A_. 


(a) 

Add 

W'V  {' 

, to  F1 

for  all  y c Ib"{*}  i 

f f^(  o % XQ  is  in 

F and  x t *. 

(b) 

Add 

f„0,  x,)'x0. 

, 0,  or 

fx(lKX0,  0,  to  F1 

if  f^(X j ) % Xq  or 

f 'v 

X 

Xq  is  in  F,  X 

c Ib. 

(c) 

Add 

f*(  C)  ^XQ,  0, 

f*<V 

IHXq,  0,  and  fy(XQ, 

O'vXp,  a to  F*  if 

fx(£)  is  in  F 

for  a 1 1 

x e Z,  • 

D 

(d)  Add  to  F*,  fy(l)M,  6,  and  f (I,  0^1.  6,  for  all  y z Zb  and  f M , 6,  for 
all  y G £b-{*} 

Where  £ G , i = 1,  2. 

o,  v,  and  6 of  error  transitions  represent  costs  of  substitution,  deletion,  and 
insertion  errors,  respectively.  Hence,  weighted  distance  can  be  measured  by  using 
expanded  tree  automaton.  (a),  (b) , and  (c)  in  Step  introduce  stretch,  branch, 
and  split  operations,  respectively. 

The  search  algorithm  for  the  least  cost  (minimum-distance)  solution  is  to 

construct  a tree-like  transition  table  with  all  candidate  state  and  their  corresponding 

costs  recorded.  Each  element  in  the  transition  table  corresponds  to  a node  in  the 

tree  domain  of  the  input  tree.  Let  it  be  denoted  as  t if  a e D ,,  and  a'  be  the 

a a' 

input  tree,  then  a pair  (X,  n)  is  in  t if  X is  a candidate  state  representing  a'/a, 

D is  the  minimum  cost  associated  with  X,  the  algorithm  is  given  as  follows: 


ALGORITHM  6.3.  Minimum-Distance  GECTA 


Input:  Expanded  Tree  Automaton  M^  = (V,  F,  {S})  of  a tree  language  L (G ^ ) , and 

i nput  tree  ex' . 


Output : 

^L(Gt)(a')  and  transition  table  of  a'. 

Method: 

<SUBTREE(Start)> 

(a) 

SUBTREE (Start)  = {(XQ,  v)|f*X0#  v e FD}. 

(b) 

Add  (X  , n)  to  SUBTREE  (Start)  if  f(X))'bXf)>  v c F°,  (X]f  n) 

G SUBTREE (Start) , 

then  r)  = v + tt. 

(c) 

Add  (X  , n)  to  SUBTREE (Start)  if  f(X,,  X^Xg,  V e FD,  (XJt 
e SUBTREE (Start) , then  n = v + Hj  + 

tt,),  (X2,  tt2) 

(d) 

Whenever  two  or  more  items  associated  with  the  same  state. 

Delete 

items  with  higher  costs. 

<SUBTREE (X , 0)> 

(a) 

SUBTREE (X,  0)  = { (X  , n)|f(XKX0,  v e F°,  n = v + 6} 

(b) 

U((x,  0)1. 

Add  (X  , n)  to  SUBTREE (X,  0)  if  f(Y)%XQ>  v e F°,  (Y,  n)  e SUBTREE  (X , 0), 

n = v + TT. 

(c) 

Add  (X  , n)  to  SUBTREE  (X,  0)  if  f(Y,,  Y^Xg,  V G FD,  (Y]f 

IT,)  G 

SUBTREE  (Start ) and  (Y2>  tt2)  g SUBTREE  (Start) , then  n =*  V + 

TT1  + 7T2. 

(d) 

Delete  redundant  states. 

ii«R. 

1_.  If  r[ot(a)]  = 0 a(a)  = x,  then 

(a) 

(XQ,  n)  is  in  t if  ^(X^Xq,  X e FUFS  U F1  , for  all  (X,  tr) 
SUBTREE  (Start) , n = X+tt. 

i n 

(b) 

(X  , n)  is  added  to  t if  f (X.,  X )^X  , X g FUFS  U F1,  (X., 

0 3 X 1 / U 

(X2>  tt2)  g SUBTREE  (Start) , n = X + ir^  + ir2. 

tt^)  and 

(c) 

Delete  redundant  states. 

b 6 


Step  2.  If  r[a(a)]  = 1,  a(a)  = x,  then 

(a)  (X.,  n)  is  in  t if  f (Y)n,x  X e FUFS  U F'.for  all  (Y,  it)  e 

U 0 X u 

SUBTREE  (X,  6)  and  (X,  9)  e t . , n = X + tt. 

0 • I 

(b)  (XQ,  n)  is  added  to  tg  if  f ^ ( Y ^ , Y^'vXq,  X e FUFS  U F'.for  all 
(Y | , tt j ) e SUBTREE (X,  0)  and  (Y2,  tt2)  e SUBTREE (Sta rt ) or 

(Y  j , TTj)  e SUBTREE  (Start)  and  (Y2,  tt2)  e SUBTREE(X,  6)  where  (X,  0)  e ta>] 

then  n = X + TTj  + tt2« 

(c)  Delete  redundant  states. 

Step  3.  If  r[a(a)]  = 2,  a(a)  = x,  then 

(a)  (XQ,  n)  is  in  t if  f (YJP  Y^Xq,  X £ FUFS  U F1  for  all  (Y]  , ir| ) C 

SUBTREE  (X,,  0.)  and  (Y„,  tt0)  e SUBTREE^  0.)  where  (X,,  0.)  t , 

II  L L 2 1 I I 0 • I 

and  (X2>  02)  e tg  2,  then  n = X + tt^  + 

(b)  When  x t *,  add  (X^,  n)  to  tg  if  f_,.(Yj,  Y2)%Z,  0 E FUF1  and  f^Zj,  Z^^X^, 

X e FUFS,  (Z.,  tt  ) E SUBTREE  (Z,  0),  (Z2>  tt2)  e SUBTREE  (Start) , then 

n = X + TT  + TT2. 

(c)  Delete  redundant  states. 

For  all  states  in  M.  , a table  of  minimal  deleted  tree  is  first  computed  by  the 
b 

two  subroutine  SUBTREE (Start)  and  SUBTREE(X,  0).  (X^,  n)  is  generated  from  subroutine 

SUBTREE  (Start) , if  X„  =^=>w , and  w is  the  smallest  subtree  among  all  the  possible 

0 Gb 

der i vat  ions s ta rt i ng  from  nonterminal  X^.  By  referring  to  the  smallest  subtree,  we 

mean  the  number  of  nodes  in  the  subtree  is  the  least.  Similarly,  (X^,  n)  is  generated 

J. 

from  subroutine  SUBTREE(X,  9)  if  X.  ====>  41,  and  if  all  the  nodes  in  V are  terminal 

0 G, 

b 

nodes  except  one  frontier,  it  is  labeled  with  nonterminal  X^.  Furthermore,  V has  the 
least  number  of  nodes  among  all  the  possible  derivations.  Steps  (1),  (2),  and  (3)  are 
formulated  under  the  assumption  that:  (i)  the  trees  represented  by  the  states  of  the 
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immediate  successor  of  the  current  node  may  be  the  subtree  of  the  actual  derivation, 

(ii)  there  are  one  or  more  subtrees  that  may  be  deleted  between  the  current  node  and 

x 

the  node  right  adjacent  to  it.  For  example,  if  rule  X.  | is  applied  at  t , but 

u Y a 

(X,  0)  e t .,  then  (Y,  n)  must  be  in  SUBTREE (X,  0).  Let  B be  the  tree  represented 
a • l 

by  state  X and  a be  the  tree  represented  by  state  Y,  then  B is  a subtree  of  a,  odes 
in  !j  but  not  in  6,  are  assumed  to  be  deleted.  The  cost  of  deletion  is  tt-0  which  is 
minimum  among  all  the  possible  deletions.  All  the  insertions  are  handled  by  transitions 
i n F automat  teal  1 y . 

EXAMPLE  6.2.  Recogn i t ion  of  Hand-Wri tten  Character  E_ 

Suppose  a casually-written  character  E,  as  shown  in  Fig.  6.4(a),  whose  binary 
tree  representation  a1*  is  shown  in  Fig.  6.4(b),  is  to  be  recognized  automatically* 

The  expanded  tree  automaton  constructed  from  the  tree  grammar  G^.  in  Appendix  B- 1 is 
as  follows: 


Me  = (V,  F,  {S}) 


V = (S,  Uc,  B,  U , D,  C,  I)  f.  = {a,  b,  d,  h,  $,  *} 
5 B 

F: 

F:  fj  (Us,  D)  a,  S,  0 

f*  (B)  a.  Us,  0 
fb  (UB,  D)  a.  B,  0 
f*  (C)  A,  UB,  0 
fd  A.  D,  ° 

fb  (0)  A,  c,  0 


F 


S 


fy  (UB,  D)  A,  B,  o 
f A,  D,  o 

y 

f (D)  A,  c,  o 


y e {a ,d,h} 
y e {a,b,h} 
y c {a,d, h) 


*♦9 


By 

in  Fig. 
ML(Gt)  ( 


fb  (C.  I)  ~ C,  0 

f (c,  I)  * c,  a 

fy  (I)  ^ I,  6 

fy  (I,  l)  ^ l,  <5 

f 'v  1 , 6 

y 


$ 


(a) 


y G {a,d,h} 
y G (a , b ,d , h ,*} 
y c (a,b,d,h,*) 
y G {a,b,d,h} 


$ 


b 


d 

(b) 


Fig.  6.1*.  Distorted  Character  E and  its  Binary  Tree 
Representation  a1* 

using  GECTA,  the  computation  of  the  transition  table  of  a1*  is  illustrated 
6.5(a)  and  (b) . From  Fig.  6.5(b),  since  (S,  a + S + v)  is  in  tQ,  we  have 
')  = cr  + <5  + v. 


I 

I 


Start 

4v 

2v 

2v 

V 

us 

0 

- 

- 

- 

- 

B 

0 

0 

- 

- 

- 

UB 

2v 

0 

0 

- 

- 

C 

2v 

2v 

0 

0 

- 

D 

3v 

3v 

V 

V 

0 

(Y,  it)  e SUBTREE(X,  0) 

(a)  Table  of  Minimal  Deleted  Trees 


I 


Fig.  6.5.  The  Acceptance  of  a'*  by  GECTA 
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7.  Application  of  Error-Correcting  Parsing  Schemes  to  Pattern  Recognition 

Using  the  error-correcting  syntax  analyzer  in  solving  the  problem  of  noisy  and/or 
distorted  patterns  classification  and/or  reconstruction  has  recently  been  suggested  [23]- 
So  far,  little  has  been  done  on  an  overall  evaluation  of  this  approach.  The  classifi- 
cation and  reconstruction  capability  of  some  error-correcting  parsers  have  been 
illustrated  in  [ 1 8 ] and  Section  A of  this  report.  We  may  conclude  that  high  accuracy 
and  low  efficiency  are  the  characteristics  of  error-correcting  parsers.  The  questions 
are:  Is  there  a necessity  for  using  an  error-correcting  parser?  What  is  the  best  role 

an  error-correcting  parser  should  play  in  a syntactic  pattern  recognition  system? 

An  example  of  recognition  of  hand-printed  characters  by  using  GECTA  is  given  in 
Section  7.1  where  the  operation  of  GECTA  is  illustrated.  The  merits  and  weaknesses 
existing  in  general  error-correcting  parsing  schemes  can  also  be  shown  by  this  example. 

7.1  An_  I 1 1 ustrat i ve  Example  - Hand-Pr i nted  Character  Recogn i t ion 
A classic  example  in  the  field  of  pattern  recognition,  the  hand-printed  character 
recognition  problem,  is  given  in  this  section  as  an  illustration.  A thorough  survey 

Ion  the  subject  of  character  recognition  has  been  given  by  Munson  [3M.  As  Munson 

pointed  out,  the  great  spatial  variability  of  hand-printed  characters  has  led  research 
workers  to  explore  more  sophisticated  preprocessing  and/or  feature  extraction  methods. 
Nearly  a 30%  correct  classification  rate  on  single-coder  file  and  80%  on  multiple- 
coder  file  are  reported  in  [3^1  where  a linear  classifier  is  used.  Syntactic  de- 
scriptions of  hand-printed  characters  have  been  proposed  recently  [31 » 35,  36,  37]. 

In  [35,  36],  shape  and/or  orientation  of  line  segments,  as  well  as  spatial  relation 
between  lines,  are  coded  and  then  composed  into  syntax  rules.  Shaw  [37]  uses  variant 
size  and  direction  of  line  segments  as  pattern  primitives.  Concatenate  relations  between 
primitives  are  described  by  PDL.  Instead  of  analyzing  the  skeletons  of  patterns,  Fena  anr 
Pavlidis  [31]  consider  character  patterns  as  polygons.  In  their  approach,  pattern 
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primitives  are  the  simpler  components  obtained  from  decomposition  of  polygons.  During 
decomposition,  juxtaposition  relation  results  in  a labeled  graph.  The  resulting  de- 
scriptions are  generally  size  and  rotation  invariant;  thus,  error  tolerable  to  some 
extent.  The  decomposition  algorithm,  which  is  implemented  in  Fortran  on  an  IBM  360/91 
computer,  costs  approximately  A sec.  to  process  a single  Chinese  character.  Stallings  138] 
encoded  components  of  Chinese  character  by  using  contour  tracing  technique  and  Freeman's 
chain  code.  Information  about  direction  and  concatenation  of  strokes  are  contained  in 
the  resulting  code  strings.  For  recognition  accomplished  by  using  a decision  tree 
search  method,  it  is  reported  by  Narasimhan  and  Reddy  [36]  that  the  average  processing 
time  per  character  is  6 seconds  on  a CDC  3600  computer. 

In  this  section,  input  characters  are  assumed  to  be  digitized  line  patterns  in 
a 16  x 16  format.  After  chain  coding  [32]  and  primitive  extracting,  input  patterns 
are  transformed  into  their  corresponding  tree  representations.  A set  of  grammars 
for  sample  characters  are  given;  hence,  a GECTA  is  constructed . Input  patterns  are 
then  analyzed  by  the  GECTA  and  classified  based  on  minimum-distance  criterion. 

An  example  of  an  input  pattern  is  shown  in  Fig.  7.1(a).  Assume  the  leftmost 
of  the  top  row  to  be  the  starting  point.  From  the  starting  point,  the  input  pattern 
is  chain  coded  point  by  point.  The  successor  point  is  coded  as  a,  b,...,  or  h ac- 
cording to  its  relative  position  to  the  current  node.  The  resulting  chain  code  is 
shown  in  Fig.  7.1(b).  In  Fig.  7.1(b),  the  point  labeled  as  "I"  is  the  starting 
point.  The  majority  code  in  a continuous  line  segment  consisting  of  eight  coded  points 
is  selected  as  the  primitive  of  that  line  segment.  Primitives  selected  by  this  method 
approximate  the  primitives  assigned  in  Fig.  6.1.  When  a line  is  terminated  or  branched 
before  it  counts  to  eight  points,  the  short  line,  if  longer  than  two  points,  will  be 
considered  as  a normal  line  segment;  thus,  a primitive.  Otherwise,  the  line  is 
neglected.  The  binary  tree  representation  of  pattern  E is  given  in  Fig.  7.1(c). 
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The  leftmost  I of  each  strings  in  the  column  entitled  as  "tree  domain"  corresponds 
to  the  identity  of  U in  Definition  2.1.  The  rest  symbols  in  the  string  are  added 

according  to  Definition  2.2.  Therefore,  I,  11,  12,  111, , etc.  in  Fig.  7.1(c) 

correspond  to  tree  domain  0,  0-1,  0-2,  0-1-1, , etc.  by  Definition  2.2. 
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Fig.  ?• 1 • Input  Format,  Chain  Code  Result,  and  Binary  Tree  Representation 
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Five  characters,  A,  C,  D,  E,  H,  are  used  in  this  experiment.  The  assumed 
sample  patterns  for  each  of  the  five  characters  and  their  tree  representations  are 
shown  in  Fig.  7.2.  Their  respective  tree  grammars  are  given  in  Appendix  B.l. 

A total  of  26  patterns,  casually  or  meticulously  printed  characters,  are  tested. 
Assume  that  a = v = 6 = 1,  the  input  patterns,  their  tree  representations,  as  well  as 
classification  results  and  their  distance  from  the  assigned  sample  pattern  are  given 
in  Appendix  B.2.  The  entire  recognition  scheme  is  programmed  in  Fortran  IV  on  a 
CDC  6500  computer.  The  cpu  time  used  for  chain  coding  and  generating  tree  repre- 
sentation of  each  input  is  given  under  the  title  "Time  used  for  linking  a tree." 

The  actual  recognition  time  is  illustrated  as  "Time  used  for  parsing,"  The  average 
recognition  time  is  *t.l  sec.  per  character. 

These  26  test  patterns  are  also  processed  by  the  tree  automata  constructed  for 
grammars  G^,  G^.,  Gp,  G^,  G^  shown  in  Appendix  B.l.  Evidently,  a pattern  is  acceptable 
only  when  its  distance  from  a sample  pattern  is  zero  (see  Appendix  B.2).  Therefore, 
six  out  of  the  26  test  patterns  are  recognizable.  The  processing  time  is  .012  sec.  per 
character  (average  over  the  six  recognizable  patterns). 

Compared  with  the  non-error-correcting  scheme,  GECTA  has  the  potential  of 
recognizing  highly  distorted  patterns,  such  as  cursive  handwritings.  Due  to  the 
exhaustive  search  method  that  the  error-correcting  parsing  scheme  employs,  the  main 
drawback  is  its  slow  processing  time.  (Some  simple  techniques  we  suggest  in  [18]  can 
be  used  to  reduce  the  processing  time  significantly.) 

7.2  Cone  1 us  ion 

The  first  question  we  posed  at  the  beginning  of  this  section,  the  necessity  of 
using  an  error-correcting  parser  in  pattern  recognition,  can  possibly  be  answered  by 
the  remarks  we  gave  in  Section  k.  When  reconstruction  of  noisy  and/or  distorted 


Fig.  7.2.  Sample  Patterns  of  Character  A,  C,  D,  E,  H 
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patterns  is  required,  or  when  it  is  too  cumbersome  to  infer  a grammar  to  generate 
most  of  the  irregular  patterns,  the  use  of  an  error-cor rec t i ng  parser  seems  to  be 
necessary . 

The  main  purpose  of  Section  1 .\  is  not  to  solve  the  problem  of  hand-printed 
character  recognition.  Rather,  it  demonstrates  a c lass i f icat ion  problem  where  the 
number  of  classes  and  parameters  of  each  class,  i.e.,  grammars,  are  known.  Yet, 
no  learnirg  was  conducted  in  this  example.  That  is,  the  pattern  grammars  were  not 
modified  after  a new  pattern  was  classified.  Due  to  the  inefficiency  of  error- 
correcting  parsers,  we  prefer  to  have  pattern  grammars  to  be  able  to  represent 
most  shape  and/or  size  variations  that  exist  in  the  same  class  of  patterns.  In 
many  cases,  training  data  does  not  provide  sufficient  information.  A feedback 
between  recognition  and  analysis  in  a syntactic  pattern  recognition  system  [12] 
that  provides  updated  information  can  be  conducted  by  adding  an  error-correcting 
parser.  A block  diagram  of  the  feedback  system  is  shown  in  Fig.  7.3. 

A similar  approach  can  also  be  used  in  designing  a decision-directed  learning 
system  where  correct  classi f ications  of  learning  observations  are  unknown.  A flowchart 
of  this  learning  system  is  illustrated  in  Fig.  7.^.  The  error-correcting  syntax 
analyzer  in  such  a system  provides  the  grammatical  inference  machine  the  classifi- 
cations of  Input  samples. 


recogn 

analys 


classi fied 

patterns 


Fig.  7.3.  Block  Diagram  of  Feedback  Pattern  Recognition  System 
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(d)  Character  E 
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Append i x B ■ 2 . Classification  Results  of  26  Test  Patterns 
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